Assessing Text Semantic Similarity Using Ontology

نویسندگان

  • Hongzhe Liu
  • Pengfei Wang
چکیده

Sentence and document similarity assessment is key to most NLP applications. This paper presents a novel measure of calculating the similarity between sentences or between documents using ontology. The similarity is assessed using sentence or document concept vector forming from finding the linkage between ontology terms and sentence or document content, the linage can be used to generate semantic indexes of sentences or document and apply them to implement highly efficient searching algorithms to compute sentence or document similarity, and the difference between the sentence and document similarity measurement is articulated. Results were verified through experiments. Experiments show that this technique is efficient and compares favorably to other similarity measures, and it is flexible enough to allow the user to make comparisons without any additional dictionary or corpus information. We believe that this method can be applied in a variety of text knowledge representation and discovery applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A method for ontology-based semantic relatedness measurement

There are many methods having different approaches for assessing similarity and relatedness and they are used in many application areas, including web service discovery, invocation and composition, word sense disambiguation, information retrieval, ontology alignment and merging, document clustering, and short answer grading. These methods can be categorized as path-based, information content-ba...

متن کامل

Toponym Disambiguation Using Ontology-Based Semantic Similarity

We propose a new heuristic for toponym sense disambiguation, to be used when mapping toponyms in text to ontology concepts, using techniques based on semantic similarity measures. We evaluated the proposed approach using a collection of Portuguese news articles from which the geographic entity names were extracted and then manually mapped to concepts in a geospatial ontology covering the territ...

متن کامل

Learning Semantic Relatedness from Human Feedback Using Relative Relatedness Learning

An important topic in Semantic Web research is to learn ontologies from text. Here, assessing the degree of semantic relatedness between words is an important task. However, many existing relatedness measures only encode information contained in the underlying corpus and thus do not directly model human intuition. To solve this, we propose RRL (Relative Relatedness Learning) to improve existing...

متن کامل

Limits of Lexical Semantic Relatedness with Ontology-based Conceptual Vectors

Conceptual vectors can be used to represent thematic aspects of text segments, which allow for the computation of semantic relatedness. We study the behavior of conceptual vectors based on an ontology by comparing the results to the Miller-Charles benchmark. We discuss the limits to such an approach due to explicit mapping, as well as the viability of the Miller-Charles dataset as a benchmark f...

متن کامل

Improving Semantic Similarity for Pairs of Short Biomedical Texts with Concept Definitions and Ontology Structure

Finding semantic similarity between short biomedical texts, such as article abstracts or experiment descriptions, may provide important information for health researchers. This paper presents a method for calculating text similarity in the biomedical context. The method implements a pairwise concept semantic similarity measure that uses concept definitions and ontology structure. The respective...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JSW

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014